智能论文笔记

Towards deep observation: A systematic survey on artificial intelligence techniques to monitor fetus via Ultrasound Images

Mahmood Alzubaidi , Marco Agus , Khalid Alyafei , Khaled A Althelaya , Uzair Shah , Alaa Abd-Alrazaq , Mohammed Anbar , Michel Makhlouf , Mowafa Househ

分类：机器学习 | 计算机视觉

2022-01-17

开发旨在增强胎儿监测的创新信息学方法是生殖医学研究的新领域。已经对人工智能（AI）技术进行了几项评论，以改善妊娠结局。他们的限制是专注于特定数据，例如怀孕期间母亲的护理。这项系统的调查旨在探讨人工智能（AI）如何通过超声（US）图像帮助胎儿生长监测。我们使用了八个医学和计算机科学书目数据库，包括PubMed，Embase，Psycinfo，ScienceDirect，IEEE Explore，ACM图书馆，Google Scholar和Web of Science。我们检索了2010年至2021年之间发表的研究。从研究中提取的数据是使用叙述方法合成的。在1269项检索研究中，我们包括了107项与调查中有关该主题的查询的不同研究。我们发现，与3D和4D超声图像（n = 19）相比，2D超声图像更受欢迎（n = 88）。分类是最常用的方法（n = 42），其次是分割（n = 31），与分割（n = 16）集成的分类和其他其他杂项，例如对象检测，回归和增强学习（n = 18）。妊娠结构域中最常见的区域是胎儿头（n = 43），然后是胎儿（n = 31），胎儿心脏（n = 13），胎儿腹部（n = 10），最后是胎儿的面孔（n = 10）。在最近的研究中，深度学习技术主要使用（n = 81），其次是机器学习（n = 16），人工神经网络（n = 7）和增强学习（n = 2）。 AI技术在预测胎儿疾病和鉴定怀孕期间胎儿解剖结构中起着至关重要的作用。需要进行更多的研究来从医生的角度验证这项技术，例如试点研究和有关AI及其在医院环境中的应用的随机对照试验。

translated by 谷歌翻译

Quantum-Inspired Tensor Neural Networks for Option Pricing

Raj G. Patel , Chia-Wei Hsing , Serkan Sahin , Samuel Palmer , Saeed S. Jahromi , Shivam Sharma , Tomas Dominguez , Kris Tziritas , Christophe Michel , Vincent Porte

分类：机器学习

2022-12-28

Recent advances in deep learning have enabled us to address the curse of dimensionality (COD) by solving problems in higher dimensions. A subset of such approaches of addressing the COD has led us to solving high-dimensional PDEs. This has resulted in opening doors to solving a variety of real-world problems ranging from mathematical finance to stochastic control for industrial applications. Although feasible, these deep learning methods are still constrained by training time and memory. Tackling these shortcomings, Tensor Neural Networks (TNN) demonstrate that they can provide significant parameter savings while attaining the same accuracy as compared to the classical Dense Neural Network (DNN). In addition, we also show how TNN can be trained faster than DNN for the same accuracy. Besides TNN, we also introduce Tensor Network Initializer (TNN Init), a weight initialization scheme that leads to faster convergence with smaller variance for an equivalent parameter count as compared to a DNN. We benchmark TNN and TNN Init by applying them to solve the parabolic PDE associated with the Heston model, which is widely used in financial pricing theory.

translated by 谷歌翻译

Automatic Text Simplification of News Articles in the Context of Public Broadcasting

Diego Maupomé , Fanny Rancourt , Thomas Soulas , Alexandre Lachance , Marie-Jean Meurs , Desislava Aleksandrova , Olivier Brochu Dufour , Igor Pontes , Rémi Cardon , Michel Simard

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-26

This report summarizes the work carried out by the authors during the Twelfth Montreal Industrial Problem Solving Workshop, held at Universit\'e de Montr\'eal in August 2022. The team tackled a problem submitted by CBC/Radio-Canada on the theme of Automatic Text Simplification (ATS).

translated by 谷歌翻译

Semi-supervised GAN for Bladder Tissue Classification in Multi-Domain Endoscopic Images

Jorge F. Lazo , Benoit Rosa , Michele Catellani , Matteo Fontana , Francesco A. Mistretta , Gennaro Musi , Ottavio de Cobelli , Michel de Mathelin , Elena De Momi

分类：计算机视觉 | 机器学习

2022-12-21

Objective: Accurate visual classification of bladder tissue during Trans-Urethral Resection of Bladder Tumor (TURBT) procedures is essential to improve early cancer diagnosis and treatment. During TURBT interventions, White Light Imaging (WLI) and Narrow Band Imaging (NBI) techniques are used for lesion detection. Each imaging technique provides diverse visual information that allows clinicians to identify and classify cancerous lesions. Computer vision methods that use both imaging techniques could improve endoscopic diagnosis. We address the challenge of tissue classification when annotations are available only in one domain, in our case WLI, and the endoscopic images correspond to an unpaired dataset, i.e. there is no exact equivalent for every image in both NBI and WLI domains. Method: We propose a semi-surprised Generative Adversarial Network (GAN)-based method composed of three main components: a teacher network trained on the labeled WLI data; a cycle-consistency GAN to perform unpaired image-to-image translation, and a multi-input student network. To ensure the quality of the synthetic images generated by the proposed GAN we perform a detailed quantitative, and qualitative analysis with the help of specialists. Conclusion: The overall average classification accuracy, precision, and recall obtained with the proposed method for tissue classification are 0.90, 0.88, and 0.89 respectively, while the same metrics obtained in the unlabeled domain (NBI) are 0.92, 0.64, and 0.94 respectively. The quality of the generated images is reliable enough to deceive specialists. Significance: This study shows the potential of using semi-supervised GAN-based classification to improve bladder tissue classification when annotations are limited in multi-domain data.

translated by 谷歌翻译

DIONYSUS: A Pre-trained Model for Low-Resource Dialogue Summarization

Yu Li , Baolin Peng , Pengcheng He , Michel Galley , Zhou Yu , Jianfeng Gao

分类：自然语言处理

2022-12-20

Dialogue summarization has recently garnered significant attention due to its wide range of applications. However, existing methods for summarizing dialogues are suboptimal because they do not take into account the inherent structure of dialogue and rely heavily on labeled data, which can lead to poor performance in new domains. In this work, we propose DIONYSUS (dynamic input optimization in pre-training for dialogue summarization), a pre-trained encoder-decoder model for summarizing dialogues in any new domain. To pre-train DIONYSUS, we create two pseudo summaries for each dialogue example: one is produced by a fine-tuned summarization model, and the other is a collection of dialogue turns that convey important information. We then choose one of these pseudo summaries based on the difference in information distribution across different types of dialogues. This selected pseudo summary serves as the objective for pre-training DIONYSUS using a self-supervised approach on a large dialogue corpus. Our experiments show that DIONYSUS outperforms existing methods on six datasets, as demonstrated by its ROUGE scores in zero-shot and few-shot settings.

translated by 谷歌翻译

Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog

Miaoran Li , Baolin Peng , Michel Galley , Jianfeng Gao , Zhu Zhang

分类：自然语言处理 | 人工智能

2022-12-20

Many efforts have been made to construct dialog systems for different types of conversations, such as task-oriented dialog (TOD) and open-domain dialog (ODD). To better mimic human-level conversations that usually fuse various dialog modes, it is essential to build a system that can effectively handle both TOD and ODD and access different knowledge sources. To address the lack of available data for the fused task, we propose a framework for automatically generating dialogues that combine knowledge-grounded ODDs and TODs in various settings. Additionally, we introduce a unified model PivotBot that is capable of appropriately adopting TOD and ODD modes and accessing different knowledge sources in order to effectively tackle the fused task. Evaluation results demonstrate the superior ability of the proposed model to switch seamlessly between TOD and ODD tasks.

translated by 谷歌翻译

Objaverse: A Universe of Annotated 3D Objects

Matt Deitke , Dustin Schwenk , Jordi Salvador , Luca Weihs , Oscar Michel , Eli VanderBilt , Ludwig Schmidt , Kiana Ehsani , Aniruddha Kembhavi , Ali Farhadi

分类：计算机视觉 | 人工智能 | 机器人

2022-12-15

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and LAION have propelled recent dramatic progress in AI. Large neural models trained on such datasets produce impressive results and top many of today's benchmarks. A notable omission within this family of large-scale datasets is 3D data. Despite considerable interest and potential applications in 3D vision, datasets of high-fidelity 3D models continue to be mid-sized with limited diversity of object categories. Addressing this gap, we present Objaverse 1.0, a large dataset of objects with 800K+ (and growing) 3D models with descriptive captions, tags, and animations. Objaverse improves upon present day 3D repositories in terms of scale, number of categories, and in the visual diversity of instances within a category. We demonstrate the large potential of Objaverse via four diverse applications: training generative 3D models, improving tail category segmentation on the LVIS benchmark, training open-vocabulary object-navigation models for Embodied AI, and creating a new benchmark for robustness analysis of vision models. Objaverse can open new directions for research and enable new applications across the field of AI.

translated by 谷歌翻译

Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving

Angad Singh , Omar Makhlouf , Maximilian Igl , Joao Messias , Arnaud Doucet , Shimon Whiteson

分类：机器人 | 机器学习

2022-12-14

Multi-object state estimation is a fundamental problem for robotic applications where a robot must interact with other moving objects. Typically, other objects' relevant state features are not directly observable, and must instead be inferred from observations. Particle filtering can perform such inference given approximate transition and observation models. However, these models are often unknown a priori, yielding a difficult parameter estimation problem since observations jointly carry transition and observation noise. In this work, we consider learning maximum-likelihood parameters using particle methods. Recent methods addressing this problem typically differentiate through time in a particle filter, which requires workarounds to the non-differentiable resampling step, that yield biased or high variance gradient estimates. By contrast, we exploit Fisher's identity to obtain a particle-based approximation of the score function (the gradient of the log likelihood) that yields a low variance estimate while only requiring stepwise differentiation through the transition and observation models. We apply our method to real data collected from autonomous vehicles (AVs) and show that it learns better models than existing techniques and is more stable in training, yielding an effective smoother for tracking the trajectories of vehicles around an AV.

translated by 谷歌翻译

Financial Risk Management on a Neutral Atom Quantum Processor

Lucas Leclerc , Luis Ortiz-Guitierrez , Sebastian Grijalva , Boris Albrecht , Julia R. K. Cline , Vincent E. Elfving , Adrien Signoles , Loïc Henriet , Gianni Del Bimbo , Usman Ayub Sheikh

分类：机器学习

2022-12-06

Machine Learning models capable of handling the large datasets collected in the financial world can often become black boxes expensive to run. The quantum computing paradigm suggests new optimization techniques, that combined with classical algorithms, may deliver competitive, faster and more interpretable models. In this work we propose a quantum-enhanced machine learning solution for the prediction of credit rating downgrades, also known as fallen-angels forecasting in the financial risk management field. We implement this solution on a neutral atom Quantum Processing Unit with up to 60 qubits on a real-life dataset. We report competitive performances against the state-of-the-art Random Forest benchmark whilst our model achieves better interpretability and comparable training times. We examine how to improve performance in the near-term validating our ideas with Tensor Networks-based numerical simulations.

translated by 谷歌翻译

Grounded Keys-to-Text Generation: Towards Factual Open-Ended Generation

Faeze Brahman , Baolin Peng , Michel Galley , Sudha Rao , Bill Dolan , Snigdha Chaturvedi , Jianfeng Gao

分类：自然语言处理

2022-12-04

Large pre-trained language models have recently enabled open-ended generation frameworks (e.g., prompt-to-text NLG) to tackle a variety of tasks going beyond the traditional data-to-text generation. While this framework is more general, it is under-specified and often leads to a lack of controllability restricting their real-world usage. We propose a new grounded keys-to-text generation task: the task is to generate a factual description about an entity given a set of guiding keys, and grounding passages. To address this task, we introduce a new dataset, called EntDeGen. Inspired by recent QA-based evaluation measures, we propose an automatic metric, MAFE, for factual correctness of generated descriptions. Our EntDescriptor model is equipped with strong rankers to fetch helpful passages and generate entity descriptions. Experimental result shows a good correlation (60.14) between our proposed metric and human judgments of factuality. Our rankers significantly improved the factual correctness of generated descriptions (15.95% and 34.51% relative gains in recall and precision). Finally, our ablation study highlights the benefit of combining keys and groundings.

translated by 谷歌翻译